CDS

Accession Number TCMCG078C20671
gbkey CDS
Protein Id KAG0485744.1
Location join(42694777..42694797,42698106..42698156,42701761..42701856,42702096..42702208,42702319..42702385,42702474..42702566,42702657..42702800,42702960..42702997,42703068..42703173,42703263..42703350,42703470..42703588,42703878..42704051,42704136..42704332,42704446..42704534,42704629..42704738,42704814..42704924,42705011..42705115,42705223..42705434,42705510..42705792,42705888..42705939,42714554..42714633)
Organism Vanilla planifolia
locus_tag HPP92_009823

Protein

Length 782aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000004.1
Definition hypothetical protein HPP92_009823 [Vanilla planifolia]
Locus_tag HPP92_009823

EGGNOG-MAPPER Annotation

COG_category G
Description beta-galactosidase
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0004553        [VIEW IN EMBL-EBI]
GO:0004565        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005618        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005773        [VIEW IN EMBL-EBI]
GO:0015925        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0016798        [VIEW IN EMBL-EBI]
GO:0030312        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAAGGTGACAAAGCGAAAGGATATTATTGAAAATCTTTTTGAATTAGTTTTTAAGAAGATTGGTAAAGAGATGTGGTCTGATCTCATTCACAAAGCAAAAGAAGGAGGTCTTGATGCCATTGAAACCTATGTTTTCTGGAATTCTCATGAGCCTCGTCGACGCAAGTATGACTTTGAAGGAAACCATGATCTGATTAGATTCATCAAAGAAATACAAAATGCTGGTCTTTATGCAATTCTTCGAATTGGTCCATATGCTTGTGCTGAGTGGAATTATGGAGGATTTCCTGCTTGGCTGCGCCAAGTTCCTGGGTTGCAAATGAGGACTAACAATCAACCATTTAAGGATGAAATGCAAAACTTTACTACATTAATTGTTAACATGGTGAAGAAAGAAAGGTTATTTGCACCACAAGGAGGTCCTGTCATCTTAGCCCAGATTGAGAATGAATATGGAAATATCCAATGGGAATATGGTGATGCCGGCAAACAATATGTTCAATGGTGTGCAAAAATGGCAGACTCTCTCAACATTGGAGTTCCCTGGATCATGTGTCAACAATCAGATGCACCACAACCAATGCCCGAAGATTTTCACAGAGAACTGGACAGGATGGTGAGGTTCAAGGCTTGGGATAAGCCAGACCCTCACAGACTTGTAGAAGATTTGGCTTACTCTGTAGCTCATTTCTTTCAATCCAATGGGACACTCATTAACTATTACATGTATCATGGAGGAACAAATTTTGGTCGAACATCTGGAGGACCGTACATTACAACTTCATATGATTATGATGCTCCTTTAGATGAATATGGAAATAAAAGGCAACCCAAGTGGGGGCATTTGAAAGAACTCCATATTGTGATCAAGACAATGGAAAAGGCACTCACTTATGGTGATCGACATTATACTAATCTTGGAAACGGGTTATCTGTGACAAAGTTCTTCGGCGACAGAATGACTCCATCATGCTTTTTGATGAATGAAAACAGTTCAGCTGACGCCAATATTGATTACGAGGGGAATCAGTATTTTCTTCCTGCTTGGTCTATTAGCATTCTTCCAGATTGCAAGGAAGAAGCCTACAACACAGCTAAGGCAACTCTTGTTAATGTTCAAACATCCATCATGGTCAAGAAACCTAATGCAGCCGAAAAAGAGCCATCAAACTTGGTGTGGTCGTGGAGACCCGAGACTTTAAGGATGTCTCTTAATGGTTTGGGTGGATCTTTTACGTCAAACAAGCTTCTGGAGCAGATATCTACAAGTGCTGATCAGAGTGACTACATGTGGTACATGACAAGTGTGGATGTCGCCAATGAGGAGAAAATGACCCTTCATGTAAACACAACTGGTCATGTCCTTTATGCCTTTGTGAATGGAAGGCGTATTGGATCCCAATTTGCTCCTAATGGTGGATTTAGATTTGTGTTTGAAAAGGTGGCTACAATGAAACCAGGAAAGAATTACATCTCTTTACTCAGCGCCACAGTTGGACTCAAGAACTATGGTGCACACTATGAGCTAATGCCGGCTGGAATTGTGGATGGCCCGGTTCAATTAATCAGAGAACAAGGAGTGTTGGATCTCTCATCTAATGAATGGTCTTACAAGATTGGGCTTGATGGGTGGGAGAAAAAACTTTACCTGAAAAATTCTACAGCATATAAATGGCGCTATGGCATTATTCAAACCAGAAGACCCTTTACTTGGTACAAGACAACCTTCAAGGCTCCTCTGGGTTCTGAACCTGTGGTGGTTGATCTCCTCGGCATGGGCAAAGGAGAAGCTTGGGTGAACGGTCAAAGTCTAGGCCGATTTTGGCCGAGCTACATAGCCAACCCCGACGGTTGCAAGCAGGTATGCGACTACAGAGGCATGTATAAGGACGACAGCTGCCTTACCGGCTGCGGAGAGCCTTCTCAGAGATGGTACCACGTCCCCAGATCGTTCCTGAAGACGTCTGAACCAAACACATTGGTCTTGTTTGAAGAGGCCGGCGGCGATCCAATAGACGTGAACTTCCTTACAGTCACAGTAGGCAAGGCATGCGCAAGCGTGGCCGAGGGGAAGACCATGACCCTCTCATGCCAAGGAGACCAAACAATATCTTCCATTGAGTTTGCTAGTTTTGGAGATCCCACAGGAACTTGTGGCTTCTTCAAGAGAGGCTCTTGTGAAGCTACGGAGACCCTTTTGGCTGTTGAGAAGGTGGCATGTATTGGGCAAGCATCATGCTCGATTGAGGTCAATGAGGAGATTCTCGACATCACTTTTGTTGGGAATCAGGACCATTGCACTGCGAAAGAAGAAGTGATAGAAAAGGTGGAAGCTTCTCAGGAATAG
Protein:  
MKVTKRKDIIENLFELVFKKIGKEMWSDLIHKAKEGGLDAIETYVFWNSHEPRRRKYDFEGNHDLIRFIKEIQNAGLYAILRIGPYACAEWNYGGFPAWLRQVPGLQMRTNNQPFKDEMQNFTTLIVNMVKKERLFAPQGGPVILAQIENEYGNIQWEYGDAGKQYVQWCAKMADSLNIGVPWIMCQQSDAPQPMPEDFHRELDRMVRFKAWDKPDPHRLVEDLAYSVAHFFQSNGTLINYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNKRQPKWGHLKELHIVIKTMEKALTYGDRHYTNLGNGLSVTKFFGDRMTPSCFLMNENSSADANIDYEGNQYFLPAWSISILPDCKEEAYNTAKATLVNVQTSIMVKKPNAAEKEPSNLVWSWRPETLRMSLNGLGGSFTSNKLLEQISTSADQSDYMWYMTSVDVANEEKMTLHVNTTGHVLYAFVNGRRIGSQFAPNGGFRFVFEKVATMKPGKNYISLLSATVGLKNYGAHYELMPAGIVDGPVQLIREQGVLDLSSNEWSYKIGLDGWEKKLYLKNSTAYKWRYGIIQTRRPFTWYKTTFKAPLGSEPVVVDLLGMGKGEAWVNGQSLGRFWPSYIANPDGCKQVCDYRGMYKDDSCLTGCGEPSQRWYHVPRSFLKTSEPNTLVLFEEAGGDPIDVNFLTVTVGKACASVAEGKTMTLSCQGDQTISSIEFASFGDPTGTCGFFKRGSCEATETLLAVEKVACIGQASCSIEVNEEILDITFVGNQDHCTAKEEVIEKVEASQE